Applications of Finite-State Transducers in Natural Language Processing
نویسنده
چکیده
This paper is a review of some of the major applications of nite-state transducers in natural-language processing ranging from morphological analysis to nite-state parsing. The analysis and generation of in ected word forms can be performed e ciently by means of lexical transducers. Such transducers can be compiled using an extended regular-expression calculus with restriction and replacement operators. These operators facilitate the description of complex linguistic phenomena involving morphological alternations and syntactic patterns. Because regular languages and relations can be encoded as nite-automata, new languages and relations can be derived from them directly by the nitestate calculus. This is a fundamental advantage over higher-level linguistic formalisms.
منابع مشابه
Finite-State Transducers in Language and Speech Processing
Finite-state machines have been used in various domains of natural language processing. We consider here the use of a type of transducers that supports very efficient programs: sequential transducers. We recall classical theorems and give new ones characterizing sequential string-tostring transducers. Transducers that output weights also play an important role in language and speech processing....
متن کاملFinite State Transducers with Predicates and Identities
An extension to finite state transducers is presented, in which atomic symbols are replaced by arbitrary predicates over symbols. The extension is motivated by applications in natural language processing (but may be more widely applicable) as well as by the observation that transducers with predicates generally have fewer states and fewer transitions. Although the extension is fairly trivial fo...
متن کاملOn Some Applications of Finite - State AutomataTheory to Natural Language
We describe new applications of the theory of automata to natural language processing: the representation of very large scale dictionaries and the indexation of natural language texts. They are based on new algorithms that we introduce and describe in detail. In particular, we give pseudocodes for the de-terminization of string to string transducers, the deterministic union of p-subsequential s...
متن کامل1 Formal Language Theory
This chapter provides a gentle introduction to formal language theory, aimed at readers with little background in formal systems. The motivation is natural language processing (NLP), and the presentation is geared towards NLP applications, with linguistically motivated examples, but without compromising mathematical rigor. The text covers elementary formal language theory, including: regular la...
متن کاملFinite-State Transducers
Finite-state transducers play an important role in natural language processing. They provide a model for text and speech, as well as transformations of them. In order to reduce the complexity of their application, deterministic and minimal transducers are required. This essay introduces a particular type of finite-state transducers, subsequential string-to-weight transducers, and presents algor...
متن کامل